Lattice QCD with Domain Decomposition on Intel

نویسندگان

  • Simon Heybrock
  • Bálint Joó
  • Dhiraj D. Kalamkar
  • Mikhail Smelyanskiy
  • Karthikeyan Vaidyanathan
  • Tilo Wettig
  • Pradeep Dubey
چکیده

The gap between the cost of moving data and the cost of computing continues to grow, making it ever harder to design iterative solvers on extreme-scale architectures. This problem can be alleviated by alternative algorithms that reduce the amount of data movement. We investigate this in the context of Lattice Quantum Chromodynamics and implement such an alternative solver algorithm, based on domain decomposition, on Intel R © Xeon Phi co-processor (KNC) clusters. We demonstrate close-to-linear on-chip scaling to all 60 cores of the KNC. With a mix of singleand half-precision the domain-decomposition method sustains 400-500 Gflop/s per chip. Compared to an optimized KNC implementation of a standard solver [1], our full multi-node domain-decomposition solver strong-scales to more nodes and reduces the time-to-solution by a factor of 5. Keywords—Domain decomposition, Intel R © Xeon Phi coprocessor, Lattice QCD Categories and subject descriptors: D.3.4 [Programming Languages]: Processors – Optimization; G.1.3 [Numerical Analysis]: Numerical Linear Algebra – Sparse, structured, and very large systems (direct and iterative methods); G.4 [Mathematical Software]: Algorithm design and analysis, Efficiency, Parallel and vector implementations; J.2 [Physical Sciences and Engineering]: Physics General Terms: Algorithms, Performance

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solution of the Dirac equation in lattice QCD using a domain decomposition method

Efficient algorithms for the solution of partial differential equations on parallel computers are often based on domain decomposition methods. Schwarz preconditioners combined with standard Krylov space solvers are widely used in this context, and such a combination is shown here to perform very well in the case of the Wilson–Dirac equation in lattice QCD. In particular, with respect to even-od...

متن کامل

HMC algorithm for two-flavour lattice QCD: Schwarz-preconditioning with a one-dimensional domain decomposition

We study a variant of the Schwarz-preconditioned HMC algorithm. In contrast to the original proposal of Lüscher, we apply the domain decomposition in one lattice direction only. This is sufficient to reduce the condition number of the fermion matrix restricted to the domains compared with the full fermion matrix. For the same linear extension of the domain, less links reside on the boundaries o...

متن کامل

Data Envelopment Analysis from simulation on the Lattice QCD using CCR model

One of the most serious principles in production theory in economic is the principle of "efficiency". Simply put, efficiency can be defined as the demand that the desired goals (outputs) are achieved with the minimum use of the available resources (inputs). In order to, distinguish the relative efficiency of organizational units with multiple inputs to produce multiple outputs, "Data Envelopmen...

متن کامل

A domain decomposition algorithm for improved staggered fermions on GPUs

Lattice QCD is a numerical approach to the theory of the strong interaction. Calculations in this field answer fundamental questions about the nature of matter, provide insight into the evolution of the early universe, and play a crucial role in the search for new theories beyond the Standard Model of elementary particle physics. Massive computational resources are needed to achieve these goals...

متن کامل

Determination of the (1232) axial and pseudoscalar form factors from lattice QCD

Article is made available in accordance with the publisher's policy and may be subject to US copyright law. Please refer to the publisher's site for terms of use. The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. We present a lattice QCD calculation of the Áð1232Þ matrix elements of the axial-vector and pseudoscalar currents. ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014